Probabilistic SVM/GMM Classifier for Speaker-Independent Vowel Recognition in Continues Speech
نویسندگان
چکیده
In this paper, we discuss the issues in automatic recognition of vowels in Persian language. The present work focuses on new statistical method of recognition of vowels as a basic unit of syllables. First we describe a vowel detection system then briefly discuss how the detected vowels can feed to recognition unit. According to pattern recognition, Support Vector Machines (SVM) as a discriminative classifier and Gaussian mixture model (GMM) as a generative model classifier are two most popular techniques. Current state-ofthe-art systems try to combine them together for achieving more power of classification and improving the performance of the recognition systems. The main idea of the study is to combine probabilistic SVM and traditional GMM pattern classification with some characteristic of speech like band-pass energy to achieve better classification rate. This idea has been analytically formulated and tested on a FarsDat based vowel recognition system. The results show inconceivable increases in recognition accuracy. The tests have been carried out by various proposed vowel recognition algorithms and the results have been compared.
منابع مشابه
A Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملComparative Study of Speaker Recognition Methods: DTW, GMM and SVM
Speaker recognition is a process where a person is recognized on the basis of his/her voice signals. The problem of speaker recognition belongs to a much broader topic in scientific and engineering so called pattern classification. In this paper we provide a brief overview for evolution of pattern classification technique used in speaker recognition. We also discussed about our propose process ...
متن کاملSpeaker Adaptation for Support Vector Machine based Word Prominence Detection
In this paper we propose a new speaker adaptation method to improve the detection of prominent words in speech. Prosodic cues are difficult to extract, due to the different features different speakers are using to express, for example prominence. To overcome the problem of variation from the pool of speakers used during training and those encountered during deployment, in speech recognition spe...
متن کاملState Space Point Distribution Parameter for Support Vector Machine Based Cv Unit Classification
In this paper we extend Support Vector Machines (SVM) for speaker independent Consonant – Vowel (CV) unit classification. Here we adopt the technique known as Decision Directed Acyclic Graph (DDAG) , which is used to combine many two class classifiers into multiclass classifier. Using Reconstructed State Space (RSS) based State Space Point Distribution (SSPD) parameters, we obtain an average sp...
متن کاملCombination of clean and contaminated GMM/SVM for far-field text-independent speaker verification
This paper addresses the problem of speaker verification under reverberant conditions, using only the signal acquired by a single distant microphone. The proposed system combines four different subsystems. Two of them are Gaussian Mixture Model (GMM) based and the other two are Support Vector Machine (SVM) based. The subsystems that use the same type of classifier differ in terms of models: one...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/0812.2411 شماره
صفحات -
تاریخ انتشار 2008